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Introduction 


The present work is an attempt to develop a consistent framework for the 
description of non equilibrium phenomena on the basis of statistical mechanics 
(classical or quantal). For this purpose a phase space is constructed whose points 
(elementary events) are the possible outcomes of an experiment which consists 
of the “continuous” measurement of certain macroscopic observables. This phase 
space possesses a “‘natural’’ measure, corresponding to the Liouville measure in 
classical mechanics. In general the “natural”? measure is infinite. The equations 
of motion generate a group, parametrized by the time, of point transformations 
which preserve the “natural” measure. The constructed phase space of the ex- 
periment is hence formally analogous to the phase space of classical mechanics 
but contrary to this space it is composed entirely of macroscopically observable 
elements. The analogy to the classical phase space is further stressed by the 
existence of a “‘time reversal’ transformation which preserves the ‘natural’ 
measure. In the same way as the classical phase space is decomposed into energy 
surfaces and possibly further invariant subregions, the phase space of the ex- 
periment is decomposed into ergodic classes of finite measure, corresponding to 
different values of certain “integrals of the macroscopic motion”. One of these 
is the macroscopic energy. The integrals of the macroscopic motion characterize 
the macroscopic conditions under which a given experiment is carried out, in 
particular they determine the macroscopic equilibrium state which is possibly 
reached. The phase space structure of the experiment can be said to be equi- 
valent to a class of stochastic processes which are stationary, ergodic, and satisfy 
time reversal symmetry. Each stationary stochastic process of the class describes 
the fluctuations of the macroscopic observables in a given equilibrium state. 
The approach to equilibrium from an arbitrary initial state is described by the 
corresponding conditional stochastic process, which is determined by the sta- 
tionary process. 

The construction of the phase space of the experiment must be based on a 
prescription for the interpretation of (ideal) macroscopic observations on a micro- 
scopic level. It is wellknown that von Neumann [von Neumann, 1932 a] has 
given a prescription of how the quantum mechanical state or rather the density 
operator is changed when a non sharp observable is measured and that his point 
of view has been criticized by Liiders [Liiders, 1951] and others. It is sometimes 
erroneously proposed that the change of state by measurement is a sort of phys- 
ical procedure peculiar to quantum theory, which possibly could be considered 
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as the source of macroscopic irreversibility. In reality we are here dealing with 
a logical procedure which has a complete classical analogue, the construction of 
a conditional probability distribution. With this fact in mind we find it neces- 
sarv to attach ourselves to Liiders’ criticism of von Neumann’s prescription. 

The phase space structure of an experiment is developed in the first four 
sections and the rest of the paper is concerned with a study of the general 
properties of this structure. In particular questions concerning recurrence and 
approach to equilibrium are discussed. 


1. The energy-structure function 


For the purposes of statistical thermodynamics a mechanical system of a finite 
number of degrees of freedom is sufficiently characterized by its structure func- 
tion V (2; H); 0<H< oo, the extension-in-phase or the number of energy states 
below a certain value FE of the energy. The notation V (.; H) indicates the func- 
tional dependence of the structure function on the Hamiltonian. Equivalently 
the structure function can be given in term of its Laplace transform, the sum- 
over-states 


exp {— Y (p; Hy =| exp {—fBL}d;V(H; H); (0<B<-o), (1) 


0 


or its negative logarithm Y (6; H), the Massieu-Planck function, from which the 
thermodynamic properties of the system are directly obtained. The fundamental 
problem of statistical thermodynamics is the determination of the structure func- 
tion for mechanical systems of macroscopic dimensions. If the system can be 
considered as built up of almost independent parts the Massieu-Planck function 
of the total system is equal to the sum of the Massieu-Planck functions of its 
parts and for a system composed of a large number of similar and independent 
parts the asymptotic form of the structure function can be obtained in a rig- 
orous way by use of the techniques of the central limit theorem of probability 
[Khinchin, 1949], [Blane-Lapierre and Tortrat, 1956]. The first result of this 
kind is due to Maxwell [Maxwell, 1878], who derived the explicit form of the 
structure function corresponding to the kinetic energy of a system of particles 
having the same mass. From the exact form of the structure function he ob- 
tained its asymptotic form. 

It is an almost trivial fact that the existence of the sum-over-states for all 
positive values of f is a considerable restriction on the types of Hamiltonians 
considered in statistical thermodynamics. In particular it implies in the quantum 
mechanical case that the Hermitian operator 

exp {— PH} (2) 
in the state-vector space, having a finite trace, is compact (completely continuous) 
and hence has a pure point spectrum, which is either finite or infinite with 0 
as its only accumulation point.! Each eigenvalue is only finitely degenerated. 


ks The fact that (2) is compact may also be used as the starting point for a systematic deter- 
mination of the eigenvalues of H. : 
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In the quantum mechanical case the Stieltjes integral (1) is thus always reduced 
to a sum, in general infinite. It is important to recall these properties of the 
Hamiltonian when discussing the dynamics of the system, in particular when 
going to the limit of “infinitely large’ systems. 


2. Generalized structure functions 


For the purposes of non equilibrium thermodynamics it is sometimes neces- 
sary to consider generalizations of the thermodynamic functions corresponding 
to the equilibrium theory. On a statistical mechanical basis these generalized 
thermodynamic functions can be obtained from generalized structure functions, 
[Klein, 1952]. 

Let the macroscopic observables 


A=(A* x, A”) (3) 
be associated with the real phase functions, or Hermitian operators 
OHA a5 0), (4) 


We assume that the observables A can be measured together, so that the corre- 
sponding operators are compatible. Let 


i=(71, 2.3% (5) 


be n arbitrary intervals on the real axis. With the interval 7” and the ob- 
servable A” we associate a phase function I” (7) which has the value 1 in the 
phase points corresponding to a value of the phase function a" in the interval 7" 
and has the value 0 in every other phase point. Or we associate with the interval 
i“ the projection operator J” (2"), in the state-vector space, which corresponds 
to the interval i“ in the spectral resolution of the operator a. The product 


LOQ=FG)\ ie 120" (6) 


is again a function of the phase with values 1 and 0, or a projection operator. 
The structure function corresponding to the n observables A is defined as a 
function of the n-dimensional interval 7% (5): 


V (i; A)= SI (0) (7) 


where the symbol $ denotes integration over the phase space with the Liouville- 

measure h ‘dp, ... dq, h denoting Planck’s constant and f the number of de- 

grees of freedom, or $ denotes the trace operation. The interval function VGA) 

defines a measure in the n-dimensional Euclidean space R” which we denote by 
s mbol. 

igi ce tes expressions for the structure function (7) can be obtained under 

conditions analogous to those for the energy structure function." 


1 For a discussion see e.g. [Uhlhorn, 1960 a] where further references are given. 
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It is clear that the structure function V(.;A) represents only a part of the 
structure of the mechanical system, namely exactly that part which is determined 
by the observables A. Thus any question relative to the observables A can be 
answered with the aid of the function V(.;A), so that if the observables A are 
the only quantities accessible to observation, the relevant “‘phase space” of the 
system is the n-dimensional Euclidean space, with the “natural measure’, V (.; A) 
corresponding to the Liouville measure in the complete classical phase space. 
The characterization of the system has thus been considerably contracted, corre- 
sponding to the coarser description. 


3. Repeated observations and the corresponding structure functions 


The experimental situation which motivates the introduction of the structure 
function V(.; A) is the simultaneous measurement of the observables A at one 
single time. 

We are, however, very often interested in the comparison of observations made 
at different times. If for instance we claim that in the course of time a system 
reaches equilibrium we are referring to the behaviour of a time series of observed 
values of a macroscopic observable. 

From the statistical mechanical point of view we are therefore interested in 
all possible series of observed values corresponding to a given experiment which 
consists in the observation of a macroscopic observable at prescribed times. It 
is natural to call the totality of possible series of observed values the phase 
space of the experiment and to introduce a structure function corresponding to 
the given experiment. 

To make this clear we shall assume that the experiment consists in the deter- 
mination of the macroscopic observables A at two times t,<f,. The result of 
an experiment can then always be expressed in the form 


A (t,)€%,; A (f,) €i,} (8) 


where 7 and 7, are n-dimensional intervals (5). The structure function corre- 
sponding to observable events of the form (8) will be a function of the 2 n- 
dimensional interval i,x7,. In order to define it we need the mechanical equa- 
tions of motion. 


3.1. Classical mechanics 


In the classical case Hamilton’s equations generate a group of point trans- 
formations u(.) in the phase space Z 


Cr= w(t) lg, w(t)u(t)C=u(t+t’)l; (— oo <t, t’< + 0), (9) 


where ¢ denotes the phase point (p,, ...,q,). The point transformations Oe) 
preserve the Liouville measure d6=h ‘dp, ... dq, (Liouville’s theorem). 


' The symbol € denotes “‘is an element of”. 
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The mechanical system being closed, the equations of motion are invariant 


under time reversal or properly under reversal of the direction of motion. That 
is, there exists an involutory transformation t in the phase space 


S 


ttl =; (10) 


which preserves the Liouville measure, and is related to the group w(.) of 
kinetic transformations by 


tu (t)=u(—t)t. (11) 

By the algebra of classical observables we shall understand the system of all 

complex and bounded functions 2, y, ... on the phase space! with the algebraic 
operations defined by 

(Axt+my)(C)=Ax(C)+py(Z), (A, w complex numbers), (12) 


(ay) (C)=2(C)y (C). (13) 


The group of kinetic transformations w(.) induces a group of transformations 
U(.) of the algebra of classical observables onto itself [Koopman, 1931] 


U (€) x (C) == (u (#) C) (— co <t< +00). (14) 
Evidently U(.) is a group of isomorphisms of the algebra 


U (t) (Act+tpy)=AU (t)at+p (t)y, 
U (t) [zy]=[U (¢) x] [U (t) y] 


(15) 


which commutes with the *-operation, the taking of the complex conjugate, 
(Vignal = Ga. (16) 
Further the integral $ is invariant under U (.) 
SU (t= fdta(w(t)e)= Jdta(6)=S2,? (17) 


which is equivalent to Liouville’s theorem. 
The time reversal operator 7’ is defined by 


Tx (¢)=2 (tC) (18) 


1 Properly, we should use the expression essentially bounded with reference to the Liouville 
measure. The norm of an element x in the algebra, || z|| is defined as the essential least upper 
bound of the function | x (¢) |. The algebra of classical observables is in the mathematical term1- 
nology a commutative Banach algebra with a *-operation, the taking of the complex conjugate. 
The conventional notation for this algebra is L® (Z, d¢), Z denoting the phase space. 


2 Tf x is such that the integral Sx has a meaning. 
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for any classical observable. From the properties of the transformation T it 
follows that the time reversal operator is an involution 


f hel Wall (19) 
an isomorphism of the algebra of classical observables 
TQar py—Alaer ery 
T [xy)=[T'x] [Ty], (20) 
which commutes with the *-operation 
[Lal =e, (21) 
and that the integral $ is invariant under 7 
STu= {[dlu(rl)= [dln(0)=Sz. (22) 


Further the time reversal invariance of the equations of motion (11) is expressed 
by the commutation relation 


LO) Oa) 0. (23) 
If in particular we apply the time reversal operator 7’ to the phase functions 


a (4) which correspond to the macroscopic observables A (3) we obtain m phase 
functions 


TaN Tn cee) (24) 
which may be said to correspond to the macroscopic observables 
PA ee (A ee Ae (25) 


In (6) we defined the indicator function J (7) corresponding to values of the 
n-dimensional observable A in the interval (5). It is evident that 


T I (i) (26) 
is the indicator function corresponding to values of the time reversed observable 
TA (25) in the interval 7. 

The structure function corresponding to the experiment discussed above is now 
defined in analogy to (7) 
V (ay, ta ty, ty; A) = S(O (t,) (0,110 (te) Z (22)1.- (27) 


From the invariance property (17) it follows directly that the structure function 
depends only on the time difference so that we may write 


V (ty, 495 t, te3 A= Vii tgtes Bain A), (28) 
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Further, from the same invariance property 
Vo (ty; 493 ; A)= V(t, 4; —#; 7A), (29) 
and from the time reversal invariance (22) and the commutation relation (23) 
Vin tas a; CEA) = Vg (te, C5 tA): (30) 


There is no problem in extending the discussion to an arbitrary finite number 
of measurements of the observable A e.g. at the times 


bi lg = et. (31) 
The structure function defined in analogy to (27) 
V orp Ms ts) a2 oes AS | bd (2) es (Oe Ge) (32) 
will obviously be invariant under a translation of the origin of the time scale 
V Aba pccese tsb beewenty, Cyc 677A) saa wirg ter Ugnites vpbigoA); (33) 
and satisfy the time reversal symmetry relation 
Ve eee tebe geste eh A) = Villig, wo Vs bin sag hg): (34) 


The structure functions are connected by the consistency relations 


V (41; aeaehe UK is Tins b, an ty ffs ty; pas V (21; - A Ope t,, Sees) beat? A), (35) 
Wak toy ee) pes t,, to, eee bye; A)= V (5s eons Un; be, sverertn bis A) 
which follow immediately from the fact that 
es (36) 


3.2. Quantum mechanics 


The equations of motion in the Schrodinger picture generate a one-parameter 
group of unitary transformations in the state-vector Hilbert space corresponding 


to the mechanical system 
u(0)p=y; u(thu(t’)p=ult+t)yp (—co <t, f < + 60) (37) 
<u (t)p|u(t)y>=<plp> or w(t)’ w(t)=1. (38) 
Here yp is an arbitrary state-vector and <|> denotes the scalar product. 


1 Symmetries of the type (29) have sometimes been incorrectly interpreted as expressing time 
reversal symmetry. In reality they are only a reflection of the fact that the equations of motion 


generate a group of measure preserving point transformations. 
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The mechanical system being closed, the equations of motion are invariant 
under time reversal. Following [Wigner, 1932] we express the time reversal opera- 
tion as a conjugation operation J in the state-vector space. A conjugation opera- 
tion is an antilinear, isometric and involutory transformation of the vector space 


onto itself. That is : 


J(Apt+ugy=sAJypt+hto (yp. y arbitrary state-vectors) 


(A, w arbitrary complex numbers) Pie 
JIylJ p= vl (40) 
v= (41) 
The time reversal invariance of the equations of motion is expressed by 
J u(t)=u(—i) J, (42) 


or equivalently by the statement that the Hamiltonian is “real”, to be under- 
stood in the sense that it commutes with the conjugation operator J. 

For the purposes of statistical mechanics it is more convenient to use a re- 
presentation of the kinetic group u(.) analogous to Koopman’s in classical 
mechanics, that is to use a Heisenberg picture, and a similar representation of 
time reversal. Taking x, y, .... to be bounded linear operators! in the state-vector 
space we define U (t) by 


U (t)z=u(t)*z2u(t) (—2<t< +00). (43) 


Evidently U(.) is a group of isomorphisms of the algebra of all bounded linear 
operators 


U (t)(Axt+pyl=AU (t)a+u (t)y 


? ‘ ; (44) 
U (t) [wy] =[U (t) x] [U (t) y] 


which commutes with the *-operation, the formation of the Hermitian adjoint 
[U (t) a]* =U (t) 2°. (45) 
Further the trace-operation $ is invariant under U (t) 


SU (i) x= S2«2 (46) 


* The class of all bounded linear operators on a Hilbert space is a non commutative Banach 
algebra with a *-operation. The extensive mathematical theory of operator algebras is presented 
e.g. in [Dixmier, 1957]. 

* The trace of a bounded non negative operator is always defined but possibly equal to + co, 
The trace of an arbitrary bounded operator is defined as a sum of four terms corresponding to 
the decomposition of the operator in non negative parts «=a—b+ic—id, and has meaning if 
no expressions of the form co— co occur in the sum. 
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We define the time reversal operator 7’ by 
Txe=(JxJ)*. (47) 


From the properties of the conjugation operation J (39), (40), (41) it follows 
that the time reversal operator 7’ is an involution 


dh Me (48) 
an anti isomorphism of the algebra of bounded operators 


TP(Autmy=ATat+uTy 


(49) 
T [xy|=[Ty] [T 2}, 
which commutes with the *-operation 
ala". (50) 
and that the trace operation is invariant under 7’ 
STzx=S x. (51) 


Further the time reversal invariance (42) of the equations of motion is ex- 
pressed by 


T u(t) =u(t), (52) 
or by TU (t)=U(-4)T. (53) 


Applying in particular the time reversal operator T to the operators a (4) 
which correspond to the macroscopic observables A (3) we obtain n operators 
Ta which correspond to the macroscopic observables 7’ A. 

So far we have profited from the formal structural analogy of the algebra 
of the bounded functions on the phase space (the algebra of classical observables) 
and the algebra of the bounded linear operators on the state-vector space (the 
algebra of quantum mechanical observables).1. In the definition of the structure 
function corresponding to the experiment discussed above we must however take 
into account the non commutativity of the algebra of quantum mechanical 
observables and the special nature of the quantum mechanical process of meas- 
urement. a 

To begin with, we must consider the decomposition of the unity which corre- 
sponds to the oparators a (4) which are associated with the macroscopic ob- 
servables A (3). This decomposition is given by the projection operators J (.) 


1 Most observables are of course represented by unbounded functions or operators, €.g. energy, 
aiginonbudis angular momentum, but can be associated with bounded functions or operators, 
e.g. the I (i) (6). On the other hand, it is not necessary that every bounded operator corresponds 


to an observable. 
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(6) which are defined for each n-dimensional interval. As according to the quantum 
theory of measurement an observation can distinguish between an finitely many 
or at most denumerably many elementary events (see e.g. [von Neumann, 1932 a}), 
we can write 


I (i)= > I (a), (54) 


where I(x) denotes the projection operator corresponding to the n-dimensional 
interval which contains only the point «, and the sum extends over all points 
a in the interval 7. As was stated J(«)=0 except for at most denumerably 
many points so that the sum in (54) has a well defined meaning. 

The structure function corresponding to (27) is now defined by 


Vidi, U0) Pe Mia oes Lae) (55) 


1 Et, Hg Elg 


where VC Ons be tg AY SHO) Line (hae (eel 
x{[U (t,) L (o)] [U (te) L (xe) IF. 
From the invariance property (46) of the trace operation $ it follows as in the 


corresponding classical case that the structure function depends only on the time 
difference so that we may write 


From the same invariance property and the relation 
S(xy)=S (yx) (x, y arbitrary operators in the state-vector space) (58) 
we obtain Vig (44). 023, 8; A) = Vo .. t3 —t A) (59) 
I follows further, from the time reversal invariance (51) and the commutation 
relation (53), that 
Vi (hy tg 3 TAY = Vette, 0: tA). (60) 
As in the classical case there is no problem in extending the discussion to 


an arbitrary finite number of measurements of the Bhesrvaiice A. The structure 


function corresponding to measurements at the times (31) is defined in analogy 
0 (55); ; 


Vi (tas ntssotags. ty. aay bisa) oe > 0 VV (oct pncegcet ume. ethate 2A (61) 


a, El, apeix 
where Vee. son 5 Oey bys oes ye A) = Sa LU: (t,) L (a,)] «.. [UC (t,) f(a, ) 1h" x 


x {[U (6) Z (o%)] ... [U (ty) I (org) ]}- 


' See the note to equation (29), and [Groenwold, 1957]. 


(62) 
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The structure function (61) will obviously be invariant under a translation of 
the origin of the time 


Vets tt ot ANS VG, ett ea A) (63) 
and satisfy the time reversal symmetry relation 
VW iigd wate Ges ae acre te OA) eA (ert ts A). (64) 


The structure functions are connected by the consistency relation 


FOr nla, te ty ce, teed tel Val tytn been ti. oe beg A) © 
UR a ieee Ar ails, ames AN ee (a earns, Ads), 
_In particular we obtain by “contraction”? of V, the structure function (7) 
Vo(t;, B"; t; A)=V, (RY, 143 A)= V (@,5 A). (66) 
The consistency relations (65) and (66) follow directly from the fact that 
> I(a)=1 (67) 
xen 
is the unit operator in the state-vector space. 
3.3. The structure functions corresponding to macroscopic observations 
The structure functions 
Veen EAU ae uae) (68) 


defined in the classical case by (32) and in the quantum theoretical case by 
(61), (62) are according to their definitions non negative, additive set functions 
in the (k-n)-dimensional Euclidean space, and hence define a measure in that 
space, the “natural measure” in the “phase space of the experiment’. It follows 
from the consistency relations (35), (65) that 


V (4, seis) ip Op aloes ee t,, see ty, teeny ties A)< V (ty; A); (69) 


Evidently the “natural measure” of the whole phase space of the experiment 
will be infinite if the classical phase space has infinite Liouville measure or if 
the state-vector space is of infinite dimensionality. . 

The fundamental properties of the structure functions (68) are the consistency 
relations (35), (65), the stationarity, that is the invariance under translation of 
the origin of the time, (33), (63), and the time reversal symmetry, (34), (64). 
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Whereas structure functions derived from the classical theory satisfy the “con- 
traction relation” 


V gs chy Maa Ry aay oe Cl eon te te Gate be A) (70) 
= VG, hie ae iy et i ele A), 


this is in general not the case for structure functions derived from quantum 
theory, unless the ‘‘contraction relation’? (70) reduces to the particular case of 
a consistency relation. This failure is obviously a reflection of the irreversible 
character of the (ideal) quantum measurement, that is it is a logical consequence 
of the fact that gain of information is always connected with a certain loss of 
information. However, we are here discussing measurements of macroscopic ob- 
servables and it is intuitively clear that for such observables the irreversible 
nature of the (ideal) measurement must be completely negligible as otherwise 
the values of the macroscopic observables could not be used as objective char- 
acteristics of macroscopic systems, and hence there would exist nothing such as 
macroscopic observables and as a consequence no macroscopic systems. As we 
have no reason to expect that quantum theory is incompatible with macroscopic 
physics we must assume that the “‘contraction relation’ (70) is approximately 
valid also for structure functions derived from quantum theory. This gives us 
a necessary connection between the operators a corresponding to the macroscopic 
observables A and the Hamiltonian H of the mechanical system, the operators 
a must ‘almost’! commute with H. Though this statement has an intuitive 
significance it is not easily formulated quantitatively. 

A quantitative measure of the commutativity of the operators a with H can 
be obtained from a comparison of the density operators 


exp {¥ (8) BH} (71) 


and >, I («) exp {Y (8) -—fB H} I (a), (72) 


aeRn 


the first characterizing a canonical ensemble with temperature (f)~', where x 
is Boltzmann’s constant, and the second characterizing the same ensemble after 
the observation of A, without record of the result of the observation. It follows 
from Klein’s lemma, [Klein, 1931] that the entropy corresponding to the second 
density operator is greater than the entropy corresponding to the first one, and 
that the entropies are equal if and only if the density operators are equal, that 
is if and only if the Hamiltonian H is compatible with all the projections I («). 
If the operators a shall correspond to macroscopic observables it is therefore 
necessary but certainly not sufficient that the difference between the two en- 
tropies is an unobservable fraction of the total entropy. Even if this condition 
is physically meaningful it seems difficult to use it as an analytic characteriza- 
tion of operators corresponding to macroscopic observables. There is no difficulty 


An attempt to characterize operators, whose commutator with the Hamiltonian is “small” 


can be found in [van Kampen, 1956]. Here the emphasize is on the fact that the macroscopic 
observables are approximate constants of motion. 
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in showing that the condition is fulfilled asymptotically for systems consisting 
of a large number of similar and independent parts if the operators a are sum- 
functions. 


4. The phase space of an infinite series of observations 


We shall consider an idealized experiment which consists in the “continuous” 
measurement of the observables A (3). By this we mean that the observables 
are measured at constant, “‘short’”’ intervals of time 


Oe aie Oh eee ee yee (73) 


and that the series of measurements is infinite in both directions of the time 
axis. The last assumption is convenient for the mathematical description and 
is physically irrelevant. Using the time interval between two successive meas- 
urements, f), as unit we associate each observation with an integer k. The 
outcome of an experiment is thus an infinite series of points in the n-dimen- 
sional Euclidean space: 


ORs legen Cbs Bon need yuboe (74) 


The totality of possible series of the type (74) will be called the phase space 
(2 of the experiment, and each possible outcome a phase point w. The value 
a, is the k:th coordinate of the phase point w. 

The class of all finite dimensional structure functions 


V tie ttn woe Sieg lo Gor Lycee et as A) (75) 
satisfying the consistency relations (35) (65), induces a measure j(.) in the 
phase space (2. The procedure is the same as in the theory of stochastic pro- 
cesses, see e.g. [Kolmogorov, 1933], except for minor changes dependent on the 
fact that the measure defined by the structure functions is not normalizable. 
We shall not give the details but only remark that the procedure starts with 


the construction of the class F of so called measurable subsets of 2. These 
measurable subsets are generated by all subsets of ( of the type 


{ow | , Et, Wey Cady 00s Wise EUs KS. (76) 
The «-measure assigned to the set (76) is 
w{o|w, Et, ..., Ore Etisnh = V (tr, > ren L ..., +k; A). (77) 
Let a point transformation wu be defined on Q by 
(WU), = @x+1- (78) 


It follows directly from the stationarity of the structure functions (33), (63), 
that the transformation w preserves the w-measure in Q. That is, if # is a set 
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. . . . =) aye). . . ; 
belonging to the class J, its inverse image, wu, consisting of all points w 
such that wm€F, belongs to F and has the same p-measure 


u(u? F)=p(F). (79) 


Evidently the point transformation wu describes the kinetics of the experiment 
discussed above. We have thus obtained a phase space representation of the 
experiment which is completely analogous to the phase space representation of 
a mechanical system in the classical theory. However, the phase space repre- 
sentation of the experiment applies both to a classical and to a quantal descrip- 
tion of the mechanical system. In both cases the equations of motion generate 
an incompressible stationary flow in the phase space, determined by the measure 
preserving point transformation uw. Hence it is possible to apply the wellknown 
methods of classical statistical mechanics: ergodic theory etc. to series of ob- 
servations in systems, independent of the kind of mechanics by which the sys- 
tems are described. It should be noted that in the classical case the replace- 
ment of the phase space of the system by the phase space of the experiment 
is not a complication of the description. For obviously the description in terms 
of the phase space of the experiment is in general much coarser than the original 
in terms of the phase space of the system, the first description beeing dependent 
only on the macroscopic observables A. It must be noticed that we have evaded 
here the fundamental and very difficult problem of actual derivation of the 
structure functions from classical mechanics or from quantum mechanics re- 
spectively. 

We shall make the further assumption about the macroscopic observables A 
(3), that each has a definite parity with respect to time reversal, that is 


DAT =O (u)AN, (O(n)—= 44 or =i m= 1, 2, %)- (S80) 


With this assumption the time reversal symmetry of the structure functions (34) 
(64) can be expressed as 


V (0 bas see Glare t,, BOO < tp; A)= V (ty; Sry tas t,, sees ty; A) 


= V (tay, cant Uns — ten oe gab) Sia: 
where 47 is the “time reversed”? n-dimensional interval 
a€0t if Oa=(0(1) a’, ..., O(n) x”) Er. (82) 
We define the point transformation t in Q by 
(t@)-~=O0@,=(0(1) we, ..., O(n) w%) (83) 


and find directly that t is involutory, and from (81) that t preserves the measure 
j(.). Further t is connected with the kinetic transformation u by 


1 


GUM Aho (84) 
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H Ihre incompressible flow in the phase space 


For the description of series of observations in a mechanical system we have 
developed the structure 


OS Ue a ere (85) 


consisting of a phase space Q with a class of measurable subsets F and a meas- 
ure w(.), in general with «(Q)—= co, a measure preserving point transformation 
wu, generating an incompressible flow in Q, and finally a measure preserving, 
involutory transformation t connected with w by the relation (84) which ex- 
presses the invariance of the fundamental equations of motion under time re- 
versal. That is we have a structure formally analogous to classical mechanics. 
In this section we shall discuss some questions related to the flow in Q gener- 
ated by wu, in particular the classical problem of recurrences. 

It was recognized already by the founders of statistical mechanics, in partic- 
ular by Boltzmann and by Gibbs, that the physically relevant constituents of 
the classical phase space are not the individual phase points but those subsets 
of the phase space which have a positive Liouville-measure, though the physical 
reasons could be properly understood only after the discovery of the quantum 
theory, and though a logically consistent formulation was possible only after the 
invention of the mathematical measure theory by Lebesgue.1 The measurable 
subsets (the so called Borel sets) of the phase space may be said to form the 
class of possible events with reference to all possible observations of the system. 
Two events differing from each other by a set (an event) of Liouville-measure 
zero are physically equivalent. However, a macroscopic observer can realize not 
all possible events but only a certain subclass, even if he can repeat his experi- 
ment an arbitrary number of times. We shall call this “‘coarser’ subclass of 
all possible events the observable events. It is clear from our previous analysis 
that the observable events of the experiment considered above constitute exactly 
the class F of measurable subsets of the phase space Q of the experiment. 


The transition time 


If A is an arbitrary subset of the phase space Q, belonging to the class F 
we consider a phase point which enters* the set at the time ¢=1, and then 
remains in A until the time t=k+1. We shall say that the transition time of 
the particular phase point is equal to k. We want to characterize the set A 
relative to the flow in the phase space generated by the point transformation 
u. This could be done for instance by associating each possible transition time 
with a probability. For the computation of these probabilities we shall mainly 
use some ideas due to Kac [Kac, 1959]. For the first considerations it is not 
essential that the flow generated by w is incompressible and that w is invertible. 

A point w€Q has the transition time k exactly if? 


1 As regards Boltzmann’s point of view see 'Ehrenfest, 1911], in particular note 170. The 
second proof of the recurrence theorem, given by Gibbs [Gibbs, 1902] in particular, anticipates 
measure theoretical methods and is free from the irrelevant topological considerations of the 
original proofs due to Poincaré and Zermelo. 

2 IT apologize for using the word point in a double sense. 


3 The set C A is the complement, “not A’’, of A in QQ. 
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@oeeA or equivalently o€ECA 

uwEeEA x - weu lA 

uk weEA or equivalently w€u “A (86) 
Ue pe Nee - wen Cake 


The set (belonging to F) of all points which enter A at t=1 and leave A for 
the first time at t=k+1 is thus simply the intersection of the sets occurring 
at the right hand side in (86) 


CA Ante Ais CAS (87) 


We shall assume that 
O<piCAnuw’ As<c. (88) 
The ‘“‘probability”’ p(k) for a transition time equal to k is defined by 


peetCkia At—utC Ane AQ Ate) CA (89) 


that is, p(k) is the ratio of the measure of the set of all points entering A 
at ¢=1 and leaving A at t=k+1 for the first time to the measure of the set 
of all points entering A at t=1. It is evident that 


HACIA AU Ai AN ae 


=4{CANw Annu” A= p{CAnw An 2a or ny be 

and that the sets CAnwtAn-:-nw*A (91) 
decrease monotonically with k to 

c an nT ON: (92) 


that is the set of all points entering A at t=1 and remaining for ever in A. 
Hence we find 


k=c0 v=0o 
2. p(k) u{CAnu*A}=u{CAnu*A}—n{Ca n a” NPS (93) 
<u{CAnw'A} 
k=00 
or Pe HE or oh (94) 


STAN q\ A, denotes the intersection of the two sets A, and A,, that is the set of all points which 
are contained in both of the sets. 
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where p(co) is the probability of an “infinite transition time” 
piece) a tCAna “A= nin n wy NY. (95) 
y=1 


that is the “probability” that a particle enterning A at ¢=1 will remain in A 
for ever. 

Consider now a point which is in A at the time t=0 and moves on in A 
exactly until the time t=. A point w remains in A, k units of time precisely when 


wEeA or equivalently wE€A 
uwmed - woeutaA 


u*-twEA or equivalently w€u * PA (96) 


ukweECA ,, Ss weu “CA. 


The set (belonging to F) of all points in A which remain in A & units of 
time is the intersection of the sets occurring at the right hand side of (96) 


A iar aA ees > Aa CoA (97) 
In addition to (88) we shall assume that 
0<p(A)<o. (98) 


The ‘‘probability” q(k) that a point in A will remain in A exactly k units of 
time is defined by 


a (ku (A= (Ata Aten! A mueGA), (99) 


that is q(k) is the ratio of the measure of the set of all points in A at t=0 
which leave A at t=k for the first time, to the measure of the set of all points 
in A at t=0. Evidentally 


pina -Aun nia “UA Nu” C Ay 


(100) 
=pid iu Ata A} yan us Nie ure Ay 
and the sets Nie Nee ay aN (101) 
decrease monotonically with k to 
Taro, (102) 
v=0 


that is the set of all points in A at ¢=0 which remain for ever in A. Hence 
we find 
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> af c) {A} = {A} — pt n Ne Ue ING, (103) 
k=00 ; 
or S g(k)=1—q(e) <1, (104) 


where qg(cc) is the probability that a point in A leaves A “‘‘after an infinite 
time’, that is remains for ever in A 


q (co) {Ay=n{0 Ay. (105) 


= 


We shall now see how the “probabilities” p(.) and q(.) are related. Here 
for the first time we make explicit use of the measure preserving property of w: 


ie ON SaaS arbitrary 69): (106) 
It follows directly that 
BALAI oO coats Nae iN 
=fiu Aw An ohu Ate OCA (107) 
=p i Au Afi” Aiea “CAL 
and it follows from the condition (106) that 
pie Aen te “Alu CAlaatA he fha A tase, et loe: 


Combining (107), (108) and the definitions (89) and (99) we find the desired 
connection between p(.) and q(.) 


k) w{CA Nu" A}=[9 (hk) —¢ (b+ 1] p fA}. (109) 


We introduce the generating functions 


k=00 
P@)= p(kje"* (220), (110) 
k=0 
k=00 
and Q(z)= > g(kje* (20), (111) 
k=0 


which are welldefined according to (94) and (104). From (109) follows the rela- 
tion between the generating functions 


(2 CUNO oN y= ep AeA Nine CA (112) 
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Using (106) we find 
MICA Nw * AS=p{Anu*CA}=q(l) pw (A), (113) 
expressing the ballance of “incompressible fluid’: the measure of the sets of 
points entering A in the unit of time is equal to the measure of the sets of 


points leaving A in the unit of time. 
Hence we can write (112) as 


P(z)q(l)=Q (2) L-e]—-g()). (114) 
The limit for z | 0 of (114) exists as a consequence of (104) 
(OG (1) = O10) 0-59 (1) =9 (1); (115) 
and as in view of the assumption (88) q(1)>0, we find 
k 


P(0)= > p(k)=1 or equivalently p(co)=0. (116) 


k=1 


io 2) 


The trivial meaning of this being that, the flow being incompressible, almost 
every point entering A will leave A after a finite time. We now find the de- 
rivative of P(z) at z=0. 


Les ; Q(z 1l-e _ —_ Q(9) 


P’ (0)= hi lim . 117 
i ae ene eo = atl) fae 
or using the definitions of P(z) and Q(z), (110) and (111), 
ee Q(0)_1~q(e) 
ttrans (A) = k) k= --— = —— ; 118 
ee re ea) wee 


that is the mean transition time ttrans(A) for the set A is equal to the “‘prob- 
ability” that an arbitrary point in A does not remain for ever in A, divided by 
the ‘“‘probability”’ that the point leaves A in the unit of time.! 

We recall the formal definition (105) of g(°c°c) in terms of the measure of 


the set 


nua (119) 
y=0 
contained in A. The sets 
(ut kew A u-’ A} (120) 
y=[e p= 


1 An alternative derivation of this result is given in the appendix. 
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form an increasing sequence and have all the same measure as (119). Define 
H=00 v=00 r 
— -v 1 C 
Ao US fiver: (121) 
p=0 v=p 
It follows from the construction that A, is w-invariant 


pene aie (122) 


and contains the set (119). On the other hand the two sets have the same 
measure and are hence equivalent. If and only if A contains no subset which 
is equivalent to a w-invariant set, we have according to (105) 


q(co)=0 or equivalently yu (A,)=90. (123) 


If (123) holds we shall say that the set A is transitory. Hence we have for 
a transitory set the mean transition time 


1 (A) i. 
= = 24 
terans (A) q(l) w(Anw* CA) oe 
and for an arbitrary set 
bea (A) == (125) 


FL INS AN) 


Unfortunately the relation (113) cannot be used in general to derive the dis- 
persion of the transition time. 


The recurrence time 


By the mean recurrence time of the set A one would understand the mean 
transition time of the complementary set. To obtain a proper definition we 
consider the set 

v=00 


u”’A (126) 


(—} 


p= 


of all points of © which reach A at some finite time. We assume that the 
-measure of this set is finite. Hence the w-invariant set 


U=00 V=CO 


Ate en (127) 


M=0 v=pu 


has the same finite j-measure and is equivalent to (126). Up to equivalence 
relative to sets of fu-measure zero we have then, 


A,cAch’ (128) 


1 U F, denotes the union of the sets F,, that is the set of all points contained in one at least 
v 
of the sets F). 
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that is the set A is contained in the w-invariant set A*, of finite measure, 
and A* is the “smallest” w-invariant set containing A, whereas A, (121) is the 
“greatest” u-invariant set contained in A. The mean recurrence time of the set 
A is defined as the mean transition time of the complement of A in A*, that 
is of A*1 CA. Hence according to (118) 


* 1—4q.() 
brec (A = ttrans = == ayn a 
(A) = free (A* 9. C A) = BE (129) 
It is easily seen that 
ee) 
RON NSS NY 


Ge (1) = (1) (130) 


The relation (116) applied to p,(.) is an expression of Poincaré’s recurrence 
theorem: any point of A recurs in a finite time if A is contained in a invariant 
set of finite measure. If in particular the complement of A in A” is transitory 
(123) we have g.(c°)=0 and the mean recurrence time is 


, —_w(A*)—m (A) 
eA 


(131) 


We note that (131) is true whether A itself is transitory or not. 
In the conventional terminology the transformation u is called metrically 
transitive or ergodic if it has only trivial invariant sets, that is if 


iw Kh’ =A’ implies w(A’)=0- or w(C A’) =0. (132) 


In this case every set of positive finite measure is transitory. However, in the 
particular structure (85) corresponding to the observation of macroscopic ob- 
servables the transformation w will hardly ever be ergodic. 


Regenerative sets 


In (106) the condition of incompressibility was introduced, and the subsequent 
discussion was based on this condition. However it is of some interest to re- 
place this condition by the following. The set A will be called regenerative! 
if (88) and (98) are satisfied and if 


p(k)=q(k) (k=1, 2, -..), (133) 
an Ne Ae ieee Aha COON pk) a Ne 1i2 ee) (184) 


The meaning of (133) is that for a point the “probability” of leaving the set 
A in &k steps is independent of the past of the point, and (134) expresses a 
weaker form of incompressibility, the stationarity of the probality distribution 
of the transition times. We now replace (107) by the assumption that the set 
A is regenerative. The relation (107) holds also under the condition (134) and 
gives rise to 


1 The notation is chosen in analogy to the concept regenerative process, see [Feller, 1950]. 
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p(k) {CA nut A}=p(k) {uw *A}—p(k+1) u tA. (135) 


Defining the generating function P(z) by (110) we get from (135) after an 
elementary calculation 


P(e=—2P O_o p(y =v) —p (136) 


and the whole probability distribution p(.) is determined by p(1). In particular 
we find 


P(0)= > p(k)=1, or equivalently p(%o)=q(cc)=9, (137) 
k=1 
= : A (138) 
Ly (i= k) k= ——=bhpeas (A), 1 
(0)= 5 p(k) k= = tea 


k=00 a 1 a il 
5 7 (8) [k—teaas (AF =P (0) - [POP =F a aye TAN TAY ano 
k=1 


Hence the expression for the mean transition time is the same as (124) corre- 
sponding to the condition (107) and a transitory set. It is obviously possible 
to combine the conditions (107) and (133), (134). It must however be observed 
that the expression (139) for the variance is valid in general only if the set A 
is regenerative. Further it must be noticed that the fact that A is regenerative 
does not imply that the complementary set is regenerative. Hence the distribu- 
tion of the recurrence time cannot be obtained in a simple way. 


Variance of the transition time 


We consider a set A with the mean transition time firans(A) given by (118), 
(125). In general the transition time will vary considerably among the points 
which enter A. That means that points entering A simultaneously will leave A 
at very different times and a certain stirring of the phase points in A (in Gibbs’ 
sense) occurs. The variance of the transition time is a measure of the stirring 
produced by the transformation w in the set A. If this variance is finite it may 
be formally obtained from the generating function P(z) (114) 


k=00 
os p(k) [% — trans (A) a 2 q(*) k 
vy q(1) 


Re trane\) (Gerans (A)+ 1}, (140) 


but this expression is in general of no practical use as we cannot estimate the 
value of 2 q(k)k. Intuitively looking at the physical interpretation one would 
=1 


be inclined to assume that 
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M 
8 

=) 
S 
= 
// 
~~ 
5 

o 

i=} 

a 
ea 


(141) 


k=1 
and this holds for “tubes of flow” (see appendix), but cannot be proved for 
arbitrary sets. 
Product flow 
If the mechanical system which we are considering consists of two (dynamically) 


independent parts the phase space structure (85) will be the direct product 
of two structures 


OE oi pals), tz, b= 1/2): (142) 


in the sense that each phase point in ( is represented by a pair of phase 
points, one in Q, and one in Q, 


= (@;, Ws); (143) 
the invariant measure (.) in Q is the product measure 
pt (Ay x Ag) = fy (Ay) fg (Ag); Ay E F,, Ay € Fy, (144) 
and the transformations uw and 7 are given by 
UW = (UW, Uz We), (145) 


TW =(T1 4, T: Wo). (146) 


We shall determine the mean transition time and the mean recurrence time 
for a product set A=A,xA,. We shall first consider ¢(1), (99), the “probability” 
that an arbitrary point in the set will leave the set in the unit of time. By 
definition 
uf{Anu'tCa} 

wtAy 


(147) 


q (1) 


and the definitions of q,(1) and q.(1) are analogous. It is easily found by a 
formal calculation that 


q(1) = 4 (1) + 2 (1) — % (1) G2 (1), (148) 


but a direct probabilistic reasoning is simpler: A point w= (q,, m2) in A=A,x aN 
remains in A in the first unit of time exactly if w, remains in A, and q@, re- 
mains in A,. These events are independent and it follows that the probability 
of the combined event is 1—q=(1—q,)(1—q,). This is exactly (148). Hence we 
find the mean transition time 


1—q(c) (149) 
G1 (1) + G21) — % (1) % (1) 


ttrans (Ay x jake) a 
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and the mean recurrence time, see (129), 


A xh.) 2a seloe) a (AT) oe (AB) = pry) oa (As) 
tnoe ( 1* 2) ala) sem, (dl TP TIAN) 


(150) 


The result can be extended without difficulty to the product of an arbitrary 
number of phase spaces, that is to systems consisting of an arbitrary number 
of independent parts. We recall that the time interval t, between successive 
observations was chosen as the unit of time. Hence the mean transition time 
is in arbitrary units equal to tg: ttrans(A). Assuming q, (1) and q, (1) to be of the 
order of t, 

q(l)=Yity+O(%); (= 1, 2), (151) 


we can replace (148) by 
Y=yi1tY2t (1) (152) 


bd ce > 
and for an arbitrary number N of “components’ 


vy=N 
y= > ywto(lb=Npt+o(1). (153) 
v=1 
With the notation 
ei by (Ay) (154) 
v=1 [ly (Ay) 
the mean transition and the mean recurrence times can be expressed by 
-~ 1—q(c2) 
terans (11 A,) = ee OL) (155) 
\v=1 N Y 
per ae) seve alles Jol aya! 
and ne (11 A} — NG iT a +o(1). (156) 


Noticing that the geometric mean (154) {<1 one obtains from (156) an im- 
pression of the behaviour of the mean recurrence time, frequently called the 
Poincaré cycle /, for a product set of a large number of components correspond- 
ing to independent systems, as far as q.(°co) can be estimated. For the par- 
ticular case of recurrences in the angular variables of a system of harmonic 
oscillators an expression equivalent to (156) has been derived in [Hemmer, 
Maximon and Wergeland, 1958] by a different method originally developed by 
Smoluchowski [Smoluchowski, 1916]. 


Asymptotic distribution of the transition time 

The discussion of product flow under the additional assumption (151) could 
also have ben based directly on the continuous analogue of equation (109). To 
obtain this analogue we rewrite (109) 
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k=, k=— i= 
0 1 to bo ml 
Se. Fas | >" g(k)- oa] +2 (157) 
and obtain for fixed ¢ in the limit t, | 0 
d 
vil—PW)=— Qi) (158) 


where P(t) denotes the probability of a transition time shorter than t, and Q(t) 
denotes the probability that an arbitrary point in the set considered will leave 
this set before the elapse of the time t, and evidently 


y= Q' (0). (159) 
It is easily found that the set A is regenerative, 
Dt)=Q(t)_ exactly if Pt)=1—e¢”, (160) 


that is if the transition time is exponentially distributed. 
For an arbitrary product flow we have by a simple probabilistic argument 


1— Q(t) =[1— Q, (¢)] [1 — Q, (é)] (161) 


and a corresponding formula for several components 


1- Qi =I (1- Q,(t)], (162) 


Consider for simplicity the particular case of equal distributions 
Q,(t)=Q,(t) (v=1,...,N) Q1(0) =. (163) 
It is then easily found that the transition time divided by the number N of 


components is asymptotically exponentially distributed for large numbers N. 
The result follows from the fact that 


a(x) [ a, (5) = [re wy to(y)] > 


> WOOL ent (£ fixed, N > oo). (167) 


The same asymptotic form of the distribution is obtained also if the Q,(¢) are 
different, if the difference is “not too great”. Hence a product set of many 
components is asymptotically regenerative. We note that the direct product of 


regenerative sets is again a regenerative set. 
In principle it is possible to determine the distribution of the recurrence time 


of a product set by an analogous method but the combinatorial difficulties are 
extreme. 
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Recurrence and macroscopic observables 


It will usually be possible to decompose the phase space {2 (85) whose 
w-measure is infinite, into a sum of disjoint w-invariant, and not further de- 
composable subsets of finite j-measure, each characterized by the numerical! 
value/s of some macroscopic integral/s. In particular it will often be natural 
to assume that a measurement (possibly successive measurements) of the macro- 
scopic observables A will determine the macroscopic energy. 

Any macroscopic observable associated with the experiment discussed above 
(section 4) is represented by a function on the phase space which determines 
the corresponding structure function. For certain types of macroscopic observ- 
ables in large systems it is possible to obtain an asymptotic expression for 
the structure function with the aid of the central limit theorem. Let in particular 
x(w) be a function on Q representing a macroscopic observable, such that the 
structure function corresponding to the pair 


ay (w)=x(w), %(w)=2(u' w) (168) 


is approximately of the Gaussian form 


Vi (dx, dy; t)= (169) 
_¢ = me _ exp fe = m)? +(y— mu aba Me) (w—m) (y— " ae 
276° Vio (t) a (l= pit) 
Here @(t) is the correlation coefficient 
O<o(t)<1; lim o(f)=1; 0 (—-t)=0(t), (170) 


t30 


2: es ae Ge 9 

and o° is of the order of the number of ‘‘independent, similar’ components of 
the system. Note that the parameters C, m and o” are independent of the time. 
We are interested in the mean recurrence time for a set of the form 


{w|& <x (w)—m<é}. (171) 
We start with the particular case &;= — co, and assuming the existence of the 
= 1 
limit y= lim q(t) (151) we have 
10 bo 
nl j l a+y*?-2o(t)x 
lim — | | $$ exp| Meahats Z 
ay SictVicedie | le Solace 7 
— r<é y>§ a_i Pee : = gs ae 
ile fie re ee “ we LZ) 
V2a07 stale gs 
B<E 


Introducing the distribution function of the normalized Gaussian distribution 
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tl ieee 
D(a) = = exp {- A | dy, (173) 
an “ 
we can rewrite (172) as 
1 E x 
fo (£2) 40(2 
t : o — 9 
y = lim - — — = 
to © (") (174) 
o 


where 0=o(t). And it is found that 


y=lim a : (175) 


er 
oO 
: efile ; 
To see this we set z= —7=—— and obtain 
oV1—o? 
eer 
Bes 
_ wee ‘E-oV1—9z 
| o| 22.) 4,0 (2) ge | oi) 0'(° ~oV1-e Ja: (176) 
ee, ke Vl=—¢ ep este oe 
After integration by parts we find that (176) is equal to 
= he a 
(ae 0’ (*) a1 —o) (177) 
I o 


which gives the desired result (175). For a finite interval (&,&) we obtain in 


the same way 
rw 0 Gh) 
V1l=p) J Sly eet ed (178) 


li — 
ee o(-0 
oO Oo 


which includes (175) as a particular case. Assuming the existence of a suffi- 
cient number of derivatives, we have 


o()=1+ 0 (0)t+4Fe" (0) +o(#), 0 (0)<0, (179) 
and hence 
lim bag? Os V -+6"(0) if?p (0)=0, (180) 
t1o = 
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otherwise the limit does not exist. That is the derivative of the correlation 
coefficient must be a function continuous at t=0 (compare (170)). This on the 
other hand is exactly the necessary and sufficient condition for the existence 
of the “velocity” 

d 


t(D) 5 


dt 


which, as is easily seen, has the correlation coefficient 0” (t). It is wellknown 
that many probabilistic models of physical processes violate this condition and 
are invalid for small times. In particular this is so for models of the Brownian 
motion type. This fact was strongly emphasized already by Smoluchowski [Smo- 
luchowski, 1916] and again discussed in connection with the linear theory of ir- 
reversible processes [Casimir, 1945]. 

From (178) we readily obtain the mean transition time for the set (171) 


ea! 
tirans == SS [1 q(c)], (181) 


ea) 8) 


and the mean recurrence time 


[1 — qe (o)]. (182) 


Yoo) 


We recall that o”* is proportional to the number of similar, approximately in- 
dependent components, that is proportional to the ‘“‘size’” of the mechanical 
system. With this fact in mind it is an easy task to estimate the values of 
tirans and te, for intervals (171) of physical interest. In particular we obtain 
the mean recurrence time of a single value &=&, 


beg = - = exp (2 = i= We (eoure) i: (183) 
Veoi(0) a Ac 


According to the Cramér-Khinchin representation theorem for the correlation 
coefficient 


+00 
o(t)= f exp {iA} d, F(A), lim F(a)=0, lim F(a)=1, (184) 


where the spectral distribution F(A) is uniquely determined by o(t). In parti- 
cular, in our case 
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e' (0)= [ iAd, F(A)=0, (185) 
BOSON Biel (186) 


Hence we find that 


Ve @_Uraray, Be 


7 Te 
has the character of an average frequency, and the expression for y can be 


written 
Pi ef? | & 


rar eek (189) 


In general we can not expect that the sets of the type (171) are transitory. 
On the other hand it is not possible to discuss this question without a knowl- 
edge of all the structure functions corresponding to arbitrary series of measure- 
ments of the observable represented by « (168). Assuming a Gaussian form for 
all these structure functions we can obtain a sufficient condition by reference 
to a theorem of Grenander [Grenander, 1950], which states that a Gaussian 
stochastic process with continuous correlation g(t) is metrically transitive exactly 
when the spectral distribution F(A) is continuous, that is if there are no 
discrete frequencies. Hence we find in particular that all sets of the form (171) 
are transitory if the function F(A) is continuous, but this condition is obviously 
not necessary. 


6. On ergodicity and irreversibility 


The phase space structure (85) consisting of the space , the class F of 
measurable subsets of ©, the invariant measure j(.) the kinetic transformation 
u, and the time reversal transformation t, has been constructed as a framework 
for the description of all possible observable events occurring in the series of 
measurements of the observables A (3). The measurable subsets of (2 correspond 
uniquely (within y-equivalence) to the observable events. In order to meet the 
needs of logic the class F has to be closed under the elementary set operations 
(taking the complement, formation of denumerable unions and intersections; in 
the terminology of integration theory, F is a sigma-field). As the phase space 
structure (85) has the same fundamental properties as the phase space of a 
classical mechanical system, the developments of section 3.1 are applicable also 
to Q. The (measurable) complex functions of the phase represent the macro- 
scopic observables associated with the experiment considered. The linear oper- 
ators on phase functions which correspond to w and r are defined as in 3.1 
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Ux(w)=2(uo), (190) 


T x(w)=x (to). (191) 


A subclass of all complex functions of the phase is formed by the functions 
invariant under U. This class is obviously linear, that is closed under addition 
and multiplication by complex numbers, and invariant under the time reversal 
operation 7’ as 


Ux=2z implies UT2—=Tx. (192) 


The U-invariant phase functions represent the integrals of the macroscopic 
motion. The assignment of numerical values to all the integrals of the macro- 
scopic motion, that is to all invariant phase functions, is equivalent to a pre- 
scription of the macroscopic conditions under which the series of measurements 
of the observables A is carried out. As a consequence of the macroscopic nature 
of the observables A the number of functionally independent integrals of the 
macroscopic motion will usually be small. That is, we expect that the macro- 
scopic constraints can be characterized by a small number of real parameters: 


B=(B,, ..., Bn). (193) 


We have given here a somewhat heterodox formulation of the ergodic prob- 
lem in statistical mechanics. We recapitulate the essential points. We consider 
as given the structure of the mechanical system and the phase functions or 
operators which represent the macroscopic observables 4. Hence the phase space 
structure (85) corresponding to all possible observations of the system through 
measurements of the observables A, is determined. The integrals of the macro- 
scopic motion are the parameters which determine the macroscopic conditions 
under which a particular observation series is carried out. The ergodic problem, 
as we consider it, is the coordination of macroscopic observables and integrals 
of the macroscopic motion. By definition a particular experiment is always car- 
ried out with the values of the macroscopic integrals of the motion fixed. The 
values may of course be unknown to the observer, and the purpose of his ex- 
experiment may be a determination of them. 

The integrals of the macroscopic motion and the measure u(.) determine a 
conditional measure ?(.) in Q corresponding exactly to the conditional measure 
induced by the Liouville measure on the energy surfaces in classical mechanics. 
The technical details of the construction of the conditional measure m?(.) will 
not be given here. The problem is intimately connected with the probabilistic 
concept “‘sufficient statistic’, see [Halmos and Savage, 1949], and also [von 
Neumann, 1932] (in particular p. 617). For the description of the experiment 
discussed above we may thus restrict ourselves to one ‘‘ergodic surface’ of the 
phase space, corresponding to fixed values of the integrals B of the macroscopic 
motion. The induced measure y?(.) on this ‘‘surface” is u-invariant. According 
to its construction the decomposition of the phase space Q into “ergodic sur- 
faces” is minimal, in the sense that no further decomposition into invariant 
parts can be obtained. In the following we shall restrict ourselves to one par- 
ticular “ergodic surface’. For simplicity we use the notation (85) for the corre- 
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sponding induced phase space structure and assume that, the induced measure 
is finite 
O14 (Q) = co, (194) 


This condition is satisfied in all applications of physical interest, as the macro- 
scopic energy will always be one of the integrals of the macroscopic motion. 
The finiteness of w(.) is then a consequence of the basic assumption contained 
in equation (1). 

We observe that the reduced phase space structure corresponding to one 
“ergodic surface” will contain the time reversal transformation t only if the 
integrals of the macroscopic motion are invariant under 7’. 

We can now apply Birkhoff’s ergodic theorem which states that under the 
given conditions the time-average of any phase function exists and is equal to 
the corresponding phase average 


ee J x(w) u(do) 
aes eee 195 
ae N x=0 EMD Hw (Q) 8) 


We recall that every phase function x(w) represents a macroscopic observable 
associated with the experiment discussed above and that the phase average is 
taken over a “‘surface’’ determined by given values of the integrals of the macro- 
scopic motion. However Birkhoff’s theorem is an asymptotic statement. It is 
physically important to know how fast the convergence of the time average to 
the phase average is for particular macroscopic observables. Questions of this 
kind are conveniently studied with the aid of the spectral resolution of the 
operator U restricted to the Hilbert space of all phase functions whose square is 
integrable with respect to the measure y(.). 


6.1. The spectral resolution of U. Classification of observables 


In the Hilbert space of the phase function with integrable square, conven- 
tionally denoted by L*(Q, yw), the scalar product and the norm are denoted by 


celyy= | 2) 4(0) u(do), || || =<a| a+. (196) 
Q 


It was first pointed out by Koopman [Koopman, 1931] that the transformation 
U defined by (190) is unitary. This is a simple consequence of the fact that the 
measure (.) is invariant under the point transformation uw, and that this trans- 


formation has an inverse. 
Assuming that the integrals of the macroscopic motion are invariant under 
time reversal we find by the same argument that the transformation 7’ defined 


by (191) is unitary. 
~ . is unitary it has a spectral resolution of the form, see e.g. |Hopf, 1937], 


27 
U =f exp {id} ab, (197) 
0 
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where H;, the spectral distribution of U, is a projection valued function on the 
interval [0,22] satisfying 


E, By =Epmna, Lo=9, He =l1, Ea-0=lh, (198) 


and the integral (197) is to be understood in the sense that for arbitrary x, y 
in 1? (Q, 1) 


2 
<a|U|y>= J exp {iA} di<e| Bly, (199) 
0 


the integral (199) being an ordinary Stieltjes’ integral. 
A conjugation operation J can be defined in L*(Q, uw) by 


J x(w)=«(@). (200) 


Both the transformations U and T are real in the sense that they commute 
with the conjugation operation J. For the spectral distribution of U this com- 
mutation implies the symmetry 


J B,J =1I— Een-a+0- (201) 
The commutation relation for U and 7’, expressing time reversal invariance, 
NT = ae (202) 

gives rise to a relation similar to (201) 
TH, T =I = Ben-as0- (203) 


The spectral distribution HL, of U induces a decomposition of the Hilbert 
space into three mutually orthogonal subspaces, whose projections 


Et, B°, Be, B+ Be + He = T, (204) 


are compatible with #,, and are such that 
E,= E° E+ E* E+ EM Ey (205) 


is a decomposition of #, into an absolutely continuous part, a singular continu- 
ous part, and a singular discrete part (the eigenvalue part) with respect to 
the Lebesgue measure in [0, 27].! H% is the projection on L?(Q, 4) onto the 
subspace spanned by all eigenvectors of U. If x is an arbitrary element in 
L* (Q, w) we have the decomposition of the distribution function on [0, 27] 


<x| Hy |x) = («| B* B,|2> + (x| B® B,| x) + <e| B B,| a> (206) 


* The proof of the decomposition into a continuous and a discrete part is given in most 


texts on functional analysis. A proof of the decomposition into a singular and an absolutely 
continuous part can be found in [Uhlhorn, 1960 b]. 
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into three parts, the first distribution function being equal to the integral of a 
density function, the second being continuous with derivative zero and the third 
being a jump function, the jumps corresponding to eigenfrequencies of U. The 
different parts of the spectral distribution may be said to express various de- 
grees of irreversibility (see below). It is an easy task to show that the decom- 
position (204) is compatible with the operation J and with the operator 7’, hence 
the orthogonal decomposition of the Hilbert space corresponding to (204) is in- 
variant under J and under 7’, and the symmetry relations (201) and (202) hold 
for each part of the spectral distribution if in the formulae the unit operator I 
is replaced by the projection onto the corresponding subspace. In particular one 
finds that the eigenfrequencies occur in pairs (A, — A). 

The ergodic theorem of von Neumann can be stated in the form: For arbitrary 
x in L*(Q, w) the time average exists and is equal to a U-invariant element 


n=N-1 


lim — > U*x=E xox. (207) 
N—-co N n=0 


Here £9 is according to the definition of H, equal to the projection onto the 
subspace of eigenvectors of U corresponding to the eigenfrequency 0, that is 
the subspace of U-invariant elements. This subspace is also J- and T-invariant 
and in particular contains the normalized element e 


e(o)=n(Q)"*, <ele>= (208) 


which spans the subspace of constant phase functions. The phase average of an 
arbitrary element x is evidently equal to 


e<e|a>. (209) 


In the following we give a classification of the observables according to their 


> 


“degree of irreversibility’. 


Ergodic element 


Let an element x be called ergodic if the time average (207) and the phase 
average (209) are equal, that is if 


H.ot=e<elxy, (210) 


or equivalently if its spectral distribution || (Za —e> <el) a has no jump at 
2=0. We have already assumed that the transformation u is metrically transi- 
tive, or equivalently that the only invariant functions are the constant func- 
tions, hence in our particular case every element is ergodic. The criterion for 
ergodicity may be written 

Byo=|e> <l. (211) 


Ergodic event 


i said to be ergodic if the 
An event A (subset of the phase space) will be sai ! 
corresponding indicator function I(A, w) (1(A, w) =1 if wmEA, =0 if o€ A) is 
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an ergodic element. We find immediately that the complementary set of an er- 
godic set must be ergodic and transitory according to the definition of section 
5. Hence the simplified expression for the mean recurrence time applies to an 
ergodic set. 


Weakly mixing element 


Let an element x be called weakly mixing if «—e<e|«> does not belong to 
the subspace spanned by the eigenvectors of U, that is 


(H°+ EB) x=xz—e<el|x), (212) 


or equivalently if its spectral distribution ||(#,—|e> <e|) 2||? is a continuous 


function of A in [0, 2z]. 
Criteria 
1. An element aw is weakly mixing if and only if 
n=N-1 


] 
lim — > |<al]U"|x>—<2led <e|z>|=0. (213) 
N->0oo N n=0 


2. An element x is weakly mixing if and only if there exists a set J(x) of 
positive integers such that J(x) has density zero and such that if 7 is restricted 
to be outside J (x), then 


lim’ <a| U"|2) =Car ey <e| ay. (214) 
Ae) 


To say that J(x) has density zero means that the ratio of the number of those 
integers between 0 and »—1 that belong to J (x) to the total number, that is 7, 
tends to zero as n tends to infinity. 

The proofs of these criteria can be extracted from [Halmos, 1956]. For the 
following definitions compare also [Hopf, 1937]. 


Weakly mixing event 


An event A will be said to be weakly mixing if the corresponding indicator 
function J(A,@) is a weakly mixing element. In particular we have, using the 
second criterion above 


te (Aa AY | kell 


215) 
pes ATES Q 
nerd HI ) HI ) 
Evidently the complementary set of A is weakly mixing with A. 
Strongly mixing element 
Let an element 2 be called strongly mixing if 
a <a| 0" |= al ey <e|a>. (216) 
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Sufficient condition 


In order that an element x be strongly mixing it is sufficient that 2—¢ <e| a> 
belongs to the absolutely continuous part of the spectral distribution of U, that is 


E* x=2—ee|x), (217) 


or equivalently that the spectral distribution of x, ||(H;—|e> <e|)2||? is abso- 
lutely continuous, i.e. has a density. 


Strongly mixing event 
An event A will be said to be strongly mixing if the corresponding indicator 


function J(A,@) is a strongly mixing element. Applying (216) to J(A, .) we find 
the condition 


OT iat F ae (218) 


i) u(Q) 
illustrating the asymptotic independence of the events A and u "A. Obviously 
the complementary set of A is strongly mixing with A. 

Evidently a strongly mixing element (event) is weakly mixing and a weakly 
mixing element (event) is ergodic. We note in particular that the defined classes 
are linear. Further, in physical applications we are not likely to meet spectral 
distributions of the singular continuous type and hence the concept weakly 
mixing will not occur in our discussion of the connection between irreversibility 
and the properties of the spectral distribution of U. 

Let A be a strongly mixing event and A, an arbitrary event. In analogy to 
(218) we find 

peat Gs A) tA) (A) 


1 a ner a Se: 219 
Co ao) m(Q) u(Q) oo 


Here we may interpret 


HO) gna pay) 


as the stationary probabilities of the events A, and A, and 


Pikega 2 Aja 


as the probability of observing the event A, at t=0 and the event A at t=n. 
The relation (219) simply expresses the fact that the dependence between isa 
and A is lost for large time differences. Such expressed the probabilities refer 
to fluctuations in a system at macroscopic equilibrium. However, (219) rewritten 


in the form " 
_ P(A, Nw A) _ nia 220 
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expresses the approach to the equilibrium distribution of the conditional prob- 
—n 
SS The existence of at least one, and hence of two, non 
1 . . : . > 
trivial, mixing events, A and CA, hence gives rise to a certain stirring in Gibbs 
sense, for introducing an initial probability distribution P)(.) in Q by 


ability 


PON.) = fale (NAG) tata Geum ag) (A, arbitrary -<O,/,, j.2.9), (221) 
we have PA (ASL os (222) 
and from (219), (220) the asymptotic form for any such distribution 


lim P,, (A,) =P (A,) (223) 


no 


is seen to be the uniform distribution. If A and hence CA are only ergodic 
one has instead of (223) the weaker statement 


n=N-1 
eae P,,(A,) =P (A,). (224) 


Noo n=0 


One could be inclined to think that the existence of strongly mixing events is 
necessary for the irrreversible appearance of physical systems. That this is not 
so can be seen immediately from a study of the eigenvalue spectrum of the 
transformation U. 


The eigenvectors of U 


We have assumed that the point transformation w is metrically transitive, or 
equivalently that the vector e (208) is the unique (up to an arbitrary phase 
factor) normalized eigenvector of U corresponding to the eigenfrequency 0. U 
may have additional eigenvectors 


Ux,=exp {iA}a,, <a|a>=1 (A+0, real). (225) 


As was noticed by Koopman [Hopf, 1937] it follows from the distributive prop- 
erty (44) of U that the function 


Xj (w) x,(w) = 0 


— 


226) 


is invariant under w and hence equal to a positive constant. Hence the corre- 
sponding normalized eigenvector can be written 


e-exp {70;},  (0,(@) real). (227) 

It follows further that if 7,,A, and 4 are eigenfrequencies of w 
U exp {t6,, +7 6,,} =exp {i (A, +A,)} exp {0,,+16,,3; (228) 
UT exp {i0;}=exp {—1tA}T exp {i6,}. (229) 
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As direct consequences of (228), see [Halmos, 1956], the eigenvalues cannot be 
degenerated ae with two eigenfrequencies A, and A, also 4,+A, (mod 22) and 
4,—A, (mod 27) are eigenfrequencies. Hence it follows from (229) that 


T exp {i6,}=exp {—i6,} (230) 
or §(tw)=—O6(m) (mod 2z), (231) 


an expression of the trivial fact that the ‘‘angular variables” @ (@) are odd with 
respect to time reversal. 

For the physical application it is important that the eigenfrequency spectrum 
of U is either of the form 


Ap=md, (Mod 22%) (Ap> 0; n=0) 1, +2; ...) (232) 


where /, is the smallest positive eigenfrequency, or the eigenfrequencies form a 
point set which is dense on the unit circle, or the eigenfrequency spectrum 
contains only the point 0. In the first case the angular variables are all gener- 
ated by a single function 


6:,(@) =n 6,,(@) (mod 22). (233) 


If the corresponding eigenvectors span the space L*(Q, mu), the system is ob- 
viously equivalent to a one-dimensional harmonic oscillator in classical mechan- 
ics. This case is not likely to occur in a phase space structure generated by 
macroscopic observables. On the other hand a sufficient condition for the second 
case to occur is the existence of at least two rationally independent eigenfre- 
quencies, A, and A,: 


mi, t+ns,=0 (m,n, integer) implies m=n=0. (234) 


The uniform distribution of the eigenfrequencies of U over the unit circle 
which is a consequence of the Koopman-property (44) is remarkable. As we have 
seen we can expect that except for singular cases the eigenfrequency spectrum, 
if not empty, will be dense on the unit circle. Whether the continuous spectrum 
of U has similar uniformity properties or not is not known, though from phys- 
ical reasons it seems likely to be the case. It seems plausible that the spectrum 
of a transformation U corresponding to macroscopic observables has no gaps. 

Consider an element x representing an observable and assume that the phase 
_ ales 

<a| a> 
function of the “stochastic process” x,=U"zr, and is an expression for the 
stochastic dependence between a and 2%+y 


average of a, e<e|x>, is equal to zero. Then o@(n) is the correlation 


e(0)=1, |o(n)|<1. (235) 


If in particular x is mixing the value of the correlation function o(n) tends to 
zero with increasing n. However, completely generally we have from the spectral 
resolution of U (197) the Cramér-Khinchin representation of 9 (7) 
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27 | Qa . oe | ap | BX 
o (n) = i exp {iAn} dF (A)= | exp {tAn}d, Sala (236) 
0 0 
The ergodicity of the element x can also be expressed by (compare (210)) 
1 n=N-1 
limi > O(n) =0. (237) 


It is easily seen that if the eigenfrequency spectrum of U is dense on the unit 
circle we can find elements # with an arbitrary prescribed form of the correla- 
tion function within any time interval of finite length. If further the absolutely 
continuous part of the spectral resolution of U has no gaps we can reject the 
requirement that the time interval be of finite length. In this sense we can ob- 
tain “approach to equilibrium’? within any required degree of approximation. 


Gaussian approximation 


In many cases of physical interest it may be admissible to assume that a 
stationary stochastic process x, =U" corresponding to a macroscopic observable 
is approximately Gaussian, though it may be very difficult to give a formal 
proof of the fact, except for very special cases. Under this assumption the pro- 
cess is completely determined by its correlation function 9(m) and in particular 
the stochastic process is metrically transitive (ergodic) if and only if the spectral 
distribution of o(n) (237) is continuous [Grenander, 1950]. It can easily be seen 
that for a Gaussian process the concepts metric transitivity and weak mixing 
are equivalent. Further, if the spectral distribution (237) is absolutely continuous, 
that is possesses a density, or more generally if 9 (n)—0 for n co, the Gaussian 
process will be strongly mixing. An elementary proof of the last propositions 
can be obtained by consideration of elements of the form 


y (k; w) = exp f k Tort (k real). (238) 


From the assumption about x it follows that 


<y (k’)|U" |y (k)> =exp {— 3 [? +k? +2kk' 9 (n)]}, (239) 
and hence if for n¢J (J of density zero), lim o(n)=0, (compare (214)), 
lim <y (k) |U" | y (k)> = <y (k’) |e> <el y (k)> (240) 
ned 


which expresses the fact that each y(k) is a weakly mixing element. It follows 
from the fact that the weakly mixing elements form a subspace that the sub- 
space generated by all finite products of the form 


[]U"y(k) — (m integer) (241) 


k 
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is contained in the subspace of weakly mixing elements. However, the elements 
of the form (241) generate the complete Hilbert space L*(Q, ) of the stochastic 
process and hence the process is weakly mixing according to the definition of 
[Hopf, 1937]. Similarly if lim 9(n)=0, without restriction, it follows that (240) 


n—-co 
holds without restriction, that is y(k) is a strongly mixing element for each k. 
Hence it follows as above that any element of the Hilbert space of the sto- 
chastic process is strongly mixing and the process is strongly mixing in the ter- 
minologi of [Hopf, 1937]. 
These results can be extended to multidimensional Gaussian processes without 
difficulty. 


APPENDIX 


Simplified derivation of the mean transition time 


Consider an incompressible stationary flow in the phase space Q. Let A be 
a phase region of finite measure w(A) and let J(A) be the total flow into A 
in the unit of time = total flow out of A in the unit of time (incompressibility). 
Those points which enter A in the interval dt are said to form the front (Gibbs’ 
terminology). The measure of the front is J(A)dt. We want to determine the 
mean of the times which the points of the front spend in A=mean transition 
time t(A). 

If in particular A is a tube of flow of finite length all points of the front 
have the same transition time t(A), if differences of the order dt are neglected, 
and we have the relation 


If A can be decomposed into several disjoint tubes of flow, w(A)= > m4 (Aj) in 


such a way that J(A)= > J(A,) we find 


U 


a 
c = yes 2 
eae STACY ad KOO REN oP 


L 


An arbitrary phase region A can be decomposed into two disjoint parts A’ and 
A” such that A’ is the sum of a number of tubes of flow as described above 
and A” consists of all the points in A which remain in A forever, that is 
J(A”’)=0. Hence we have for an arbitrary phase region 


mA Aa (Aa) A3 
se era JAp nae 2 ee 


where the second factor represents the “probability” that a point chosen “at 
random” in A will leave A within a finite time. 
It is possible to make this derivation rigorous. 
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